Record function arguments in the trace #36

tzanko-matev · 2025-09-15T14:25:03Z

(AI-generated spec based on the contents of this PR)

Tracing Function Arguments on Entry and Structured Value Encoding

This specification defines how to capture Python function arguments at the moment a function starts executing (the PY_START event) and how to encode argument values into the runtime tracing format. It also defines fail‑fast error behavior for the monitoring callback and the test expectations that validate the behavior.

Audience: Junior developers familiar with Rust and Python, but with no prior knowledge of CPython frames or this codebase.

Executive Summary

Record function arguments on PY_START for all Python parameter kinds: positional‑only, positional‑or‑keyword, keyword‑only, varargs (*args), and kwargs (**kwargs).
Encode values canonically and structurally:
- None, bool, int, str as dedicated kinds (None, Bool, Int, String).
- Python tuple → Tuple with recursively encoded elements.
- Python list → Sequence with recursively encoded elements.
- Python dict → Sequence of (key, value) Tuples. Keys are encoded as String when possible; otherwise, encode the key normally.
Fail fast on irrecoverable errors during argument capture: raise a Python exception and immediately disable further monitoring callbacks for the session.
Tests assert argument presence, name mapping, stable string encoding, and structured kwargs.
Add .cargo/ to version control ignore rules.

Goals and Non‑Goals

Goals

Capture and emit all Python argument kinds on function entry.
Preserve structure of varargs and kwargs values where possible.
Provide deterministic, canonical encoding for common primitives.
Fail fast on errors (no silent fallbacks) and disable further monitoring after the first callback error.
Provide clear, verifiable test criteria.

Non‑Goals

Introducing a new mapping kind to the value schema (we reuse existing Sequence + Tuple).
Changing higher‑level tracing schemas or writer behavior beyond what is needed to attach arguments to Call events.
Unifying cross‑recorder type naming (e.g., “List” vs “Array”) beyond the choices specified here.

Background: CPython Frames and Code Objects (Quick Primer)

At the beginning of a Python function call, CPython creates a frame with locals bound for the call. The function’s code object carries metadata describing its parameters.

Key code object attributes used here (CPython 3.8+):

co_varnames: A tuple of local variable names. Parameters appear first in a defined order.
co_argcount: Total count of positional parameters. Important: in Python 3.8+, this total includes positional‑only and positional‑or‑keyword parameters (see PEP 570: Positional‑Only Parameters).
co_posonlyargcount: Count of positional‑only parameters. Useful only if you need to distinguish subgroups; we do not for this feature.
co_kwonlyargcount: Count of keyword‑only parameters.
co_flags: Bitmask; 0x04 indicates presence of *args (varargs), 0x08 indicates presence of **kwargs (varkeywords).

Reference terms: PEP 570 (Positional‑Only Parameters) and CPython code object docs.

High‑Level Design

When the monitoring system delivers a PY_START event, we:

Ensure the tracer is started for the code object and obtain a function id.
Obtain the current frame via sys._getframe(0) and the frame’s locals (f_locals).
Compute the ordered list of parameter names directly from the code object, using CPython ordering, and look up each name in f_locals.
Encode each found value using encode_value and attach the resulting args vector to the Call event payload via the trace writer.
If any irrecoverable error occurs (e.g., _getframe unavailable), raise a Python exception and immediately disable further monitoring (fail fast).

Parameter Ordering and Name Discovery

Given a bound code object and Python 3.8+ semantics:

Let pos_count = co_argcount (total positional parameters, including positional‑only and positional‑or‑keyword). Do not add co_posonlyargcount to this figure (that would double count).
Let kwonly_count = co_kwonlyargcount.
Let flags = co_flags.
Let varnames = list(co_varnames).

Derive the ordered parameter names from varnames:

Positional parameters: varnames[0 : min(pos_count, len(varnames))].
Varargs (*args): if flags & 0x04 != 0, then next name is varnames[idx].
Keyword‑only parameters: the next kwonly_count names.
Kwargs (**kwargs): if flags & 0x08 != 0, then next name is varnames[idx].

For each name in this sequence, try to fetch the value from f_locals[name]:

If present, encode it and include it.
If absent or retrieval fails, skip it silently (locals may not have been populated for some names in unusual interpreter states, but this should be rare at function entry).

Value Encoding Rules (`encode_value`)

Encode a Python object to a ValueRecord used by the trace writer. The encoder must be recursive and must follow these canonical rules:

Primitives and None

None → special NONE_VALUE constant.
bool → Bool with appropriate type_id.
int → Int with appropriate type_id.
str → String with exact text. This is canonical for text; do not fall back to Raw for str.

Containers

Python tuple → Tuple with elements = [encode_value(item) for item in tuple].
Python list → Sequence with elements = [encode_value(item) for item in list], is_slice = false, and language type name “List”.
Python dict → represent as a Sequence with language type name “Dict”, whose elements are 2‑element Tuples (key, value).
- Encode keys as String when key is a Python str.
- If a key is not a str, encode the key using normal rules (best effort). Kwarg keys are always strings, so in kwargs contexts you will observe String keys.

Fallback

For all other types, obtain a textual representation and encode as Raw with language type name “Object”.

Type registration

For every concrete kind you emit, register or look up a type_id via TraceWriter::ensure_type_id(...), using the following language type names:
- Bool → "Bool"
- Int → "Int"
- String → "String"
- Tuple → "Tuple"
- Sequence (Python list) → "List"
- Sequence (Python dict encoded as sequence of pairs) → "Dict"
- Raw → "Object"

Attaching Arguments to the Call Event

For each discovered parameter name and encoded value:

Create a full value record using TraceWriter::arg(writer, name, value_record).
Accumulate these into a Vec<FullValueRecord>.
Emit the Call event via TraceWriter::register_call(writer, function_id, args_vec).

Note: The writer manages a variable‑name table. Each argument will reference a variable_id that can be resolved to the actual name through separate VariableName events.

Error Handling and Fail‑Fast Behavior

on_py_start must return PyResult<()> instead of (). Behavior:

On success: return Ok(()).
On irrecoverable error (e.g., _getframe import or call fails, accessing locals fails in a way that prevents capture):
- Return Err(PyRuntimeError("on_py_start: failed to capture args: <reason>")).
- The callback wrapper (see below) must immediately disable future monitoring for this tool by setting events to NO_EVENTS and propagate the error to Python.

Callback wrapper behavior (PY_START only is specified, but approach generalizes):

Acquire the global tracer context.
Invoke on_py_start and match on the PyResult.
- Ok(()): return Ok(()).
- Err(err): call set_events(py, &tool, NO_EVENTS) to turn off events for this session, log an error, and return Err(err).
If the global context is absent, return Ok(()) (no tracing active).

Rationale: Turning off events on first error prevents repeated exceptions during interpreter activities like error printing (which otherwise trigger more PY_START events).

Test Specifications

Parsing helper changes (Python side)

Extend the trace parsing helper to collect:
- varnames: List[str] from VariableName events (index is variable_id).
- call_records: List[Dict[str, Any]] from raw Call payloads (to inspect args).

Test: record positional arguments on entry

Create a script:
- def foo(a, b): return a if len(str(b)) > 0 else 0
- Call foo(1, 'x') under tracing.
Assert:
- A Call for foo exists with two arguments.
- Arg 0: name a, value kind Int, value 1.
- Arg 1: name b, value kind String, text "x".

Test: record all Python argument kinds

Create a script:
- def g(p, /, q, *args, r, **kwargs): ...
- Call g(10, 20, 30, 40, r=50, k=60) under tracing.
Assert:
- Names present: p, q, args, r, kwargs.
- p == 10, q == 20, r == 50 as Int.
- Varargs (args) is either:
  - Sequence or Tuple with exactly two elements 30, 40 as Int, or
  - Raw whose text contains "30" and "40" (accepted to keep compatibility with alternative backends).
- Kwargs (kwargs) is structured as:
  - kind Sequence with one element, which is
  - kind Tuple of two elements: key record kind String with text "k"; value record kind Int with 60.

Test: fail fast when frame access fails (Rust module test via PyO3)

Start tracing with activation scoped to the test program path.
Monkeypatch sys._getframe to raise RuntimeError when called.
Execute a trivial program that triggers a Python function call under tracing.
Expect a raised exception containing _getframe info.
Execute the program again in the same process: no exception should be raised because monitoring has been disabled.
Restore _getframe and stop tracing.

Rust test fixture adaptation

Any Tracer implementations used by tests must update on_py_start signature to return PyResult<()> and return Ok(()) when no special logic is needed.

Implementation Details (Where and How)

Files and responsibilities

src/runtime_tracer.rs
- Implement/extend encode_value(py, value) per the rules above, using TraceWriter::ensure_type_id(...) for type registration.
- Change on_py_start(py, code, offset) to return PyResult<()> and implement argument capture:
  - Ensure tracer started and function_id available.
  - Build ordered parameter list from the code object (co_varnames, co_argcount, co_kwonlyargcount, co_flags). Do not double count positional‑only.
  - Obtain f_locals and collect values by name.
  - Encode values and build args with TraceWriter::arg.
  - Register the call via TraceWriter::register_call(writer, fid, args).
  - Fail fast by returning Err(...) if frame/locals access fails.
src/tracer.rs
- Change the Tracer trait method signature: fn on_py_start(...) -> PyResult<()>.
- Update docs for fail‑fast guidance.
- Update the callback wrapper callback_py_start to:
  - Call on_py_start and match on the result.
  - On Err, call set_events(py, &tool, NO_EVENTS), log, and return the error.
test/test_monitoring_events.py
- Extend parser to collect varnames and call_records.
- Add the two tests specified above.
tests/test_fail_fast_on_py_start.py
- Add the Python test that monkeypatches _getframe and asserts fail‑fast behavior with monitoring disabled after the first error.
.gitignore
- Add .cargo/ to exclude Cargo cache/config directories from version control.

Edge Cases and Defensive Choices

Missing locals for some parameter names are skipped. This is rare at function start but should not crash the tracer.
Deeply nested containers are recursively encoded. Extremely deep structures may be expensive; this is acceptable for now.
Dict encoding is general (applies to any Python dict), but kwargs contexts will always produce string keys. Non‑string keys are encoded normally.
We intentionally do not modify module‑level activation flags during fail‑fast; turning off events is sufficient to prevent further callbacks, and explicit shutdown remains idempotent.

Acceptance Criteria

At least one Call event for the tested functions contains a non‑empty args vector.
Names and values for positional parameters match exactly, including canonical String for Python str.
*args and **kwargs are present and encoded according to the rules above.
When _getframe raises, the initial call propagates an exception and subsequent calls do not re‑raise because monitoring was disabled.
Tests described in this spec pass.

Future Work

Unify list/sequence language type naming across recorders (e.g., consistently "List").
Consider introducing a dedicated mapping value kind for dictionaries to avoid overloading Sequence.
Consider stricter behavior for non‑string dict keys in non‑kwargs contexts (fail vs. best effort).

alehander92

recording kwargs as an ordered list is a good solution, looks good

Problem: Call events currently omit function arguments; RuntimeTracer emits register_call(fid, []) on PY_START. Plan: In on_py_start, read the current Python fram:e via sys._getframe(0), grab locals, and extract the first co_argcount names from code.co_varnames. For each present arg in f_locals, encode the value with encode_value and build FullValueRecord entries via writer.arg(name, value). Pass this Vec to TraceWriter::register_call. Scope: Handle positional/pos-or-keyword args only (co_argcount). Varargs/kwargs support can follow in a separate change. Tests: Add a pytest that runs a script with foo(a, b) and asserts that the Call event includes two args named a and b with correct Int/String values. ISSUE-001 solution: Encode and record function arguments on entry - Implemented argument capture in RuntimeTracer::on_py_start by reading the current frame (sys._getframe(0)), extracting the first co_argcount names from co_varnames, and encoding their values with encode_value. Falls back gracefully to empty args on any error. - Register calls with the constructed args vector so VariableName/Value and Step events precede Call as expected by the writer. - Added pytest test 'test_call_arguments_recorded_on_py_start' asserting that the Call for foo(a, b) includes two args named 'a' and 'b' with correct Int/String values. Differences from plan: kept varargs/kwargs out-of-scope; added defensive error handling around frame/locals to avoid destabilizing tracing when frame access is unavailable; test tolerates String or Raw encoding for 'b' to accommodate backend string handling.

Problem - Only positional arguments (including positional-only and pos-or-kw) are captured on PY_START. - Varargs (*args), keyword-only, and kwargs (**kwargs) are missing. Planned solution - Read counts from code object: co_argcount, co_posonlyargcount, co_kwonlyargcount. - Inspect co_flags for CO_VARARGS (0x04) and CO_VARKEYWORDS (0x08). - Derive parameter names from co_varnames in the interpreter-defined order: [posonly + pos-or-kw] [+ varargs] [+ kwonly] [+ kwargs]. - Look up each name in frame.f_locals and encode values via encode_value. - For now, encode *args/**kwargs values using existing encoder (primitives as-native, others fallback to Raw), keeping tests tolerant to backend encoding differences. Tests - Add a test that defines a function with all argument kinds and asserts that the Call event includes entries for: pos-only, pos-or-kw, varargs name, kw-only, and kwargs name, with correct names and plausible values. fix(ISSUE-002): Capture all Python argument kinds on function entry What changed - Extended on_py_start to collect names beyond co_argcount by reading co_posonlyargcount, co_kwonlyargcount, and co_flags. - Derived parameter name order from co_varnames: [posonly + pos-or-kw] [+ varargs] [+ kw-only] [+ kwargs]. - Looked up each parameter in frame.f_locals and encoded via existing encode_value. - Added a test exercising a function with pos-only, pos-or-kw, *args, kw-only, and **kwargs and asserting all argument names are present with sensible values. Notes vs plan - Kept value encoding for *args/**kwargs using the existing encoder (which may fall back to Raw); the test accepts either structured (Sequence/Tuple/Mapping) or Raw. - Did not add CodeObjectWrapper helpers for posonly/kwonly to avoid widening surface area; accessed attributes through the bound code object consistently with existing code.

… rules

Problem - Tests currently accept either String or Raw for str arguments (e.g., 'x'), which masks regressions. Plan - Ensure Python str is encoded as String in runtime tracer (already true). - Tighten tests to require kind == String with text == 'x'. - Keep varargs/kwargs flexible; their encoding may vary by backend. - Clarify encoding rule in comments. Notes - No schema changes; this is test-only in this repo. ISSUE-004: Tighten tests to require String encoding for str args What changed - Updated test_call_arguments_recorded_on_py_start to assert kind == String and text == 'x' for the string argument. - Added documentation comments to encode_value clarifying canonical encoding rules (str -> String; non-handled types -> Raw). Notes vs. plan - No runtime logic changes were needed: the Rust runtime tracer already encodes Python str as String. - Varargs/kwargs tests remain flexible, as planned. - Could not run 'just dev test' due to sandboxed, offline environment; please run locally to verify.

Signed-off-by: Tzanko Matev <[email protected]>

- Fix `on_py_start` to include positional-only parameters by selecting `co_posonlyargcount + co_argcount` names from `co_varnames` for the positional slice. - Keep existing handling for `*args`, keyword-only, and `**kwargs` intact. - Rationale: `co_argcount` counts only pos-or-keyword parameters; omitting `co_posonlyargcount` dropped names before `/` (PEP 570). This patch ensures complete positional argument coverage and aligns with CPython ordering. - Validation: Python tests already cover this via `test_all_argument_kinds_recorded_on_py_start` which asserts presence of `p` from `def g(p, /, q, *args, r, **kwargs)`. The fix satisfies that check. Implementation notes: - No new public API; minimal, focused change in `runtime_tracer.rs`. - Followed repo rules to avoid unnecessary code and changes.

Add pytest `test_fail_fast_when_frame_access_fails` that monkeypatches `sys._getframe` to raise during `PY_START`. It asserts the runtime tracer propagates a Python exception instead of silently swallowing the failure. This currently fails due to the defensive fallback in `on_py_start`.

- Change `Tracer::on_py_start` to return `PyResult<()>` so errors can propagate through the `#[pyfunction]` callback. - Update `callback_py_start` to return the tracer result. - Implement fail-fast in `RuntimeTracer::on_py_start`: raise a `RuntimeError` with a clear message instead of silently continuing with empty args when frame/locals access fails. - Adjust test tracers to new signature. A previous commit adds a failing pytest that monkeypatches `sys._getframe` to raise; with this change, it now passes by surfacing an exception.

Signed-off-by: Tzanko Matev <[email protected]>

- Simulate `sys._getframe` failure during `PY_START` and assert initial error surfaces. - Re-run the same program path to ensure tracer disables after the first error. - Test currently fails, exposing repeated callback errors (tracer remains active).

- Update `callback_py_start` to soft-stop tracing when `on_py_start` returns an error. - Perform teardown under the GLOBAL lock to avoid deadlock: - call `finish()`, unregister all callbacks from the original mask, - set events to `NO_EVENTS`, clear the code object registry, - set `global.mask = NO_EVENTS` to prevent duplicate work on uninstall. - Preserve error propagation to Python; subsequent events are not emitted. - Keeps `ACTIVE` unchanged; `stop_tracing()` remains safe and idempotent after error. This resolves repeated callback errors observed after a failure in `on_py_start`. review(codex): ISSUE-007 - review fail-fast on first callback error review(codex): ISSUE-007 - review fail-fast on first callback error

- Encode Python tuples as `Tuple` and lists as `Sequence` in `RuntimeTracer::encode_value`, recursively encoding elements. - Leaves kwargs (`dict`) as `Raw` for now; the `runtime_tracing` format has no dedicated mapping variant. - This advances ISSUE-002 by providing structured capture for `*args` instead of raw string fallback, aligning with the tests that accept `Sequence`/`Tuple` for varargs. Notes: - Kept string canonicalization (`String`) intact and existing primitive encodings unchanged. - Did not tighten Python-side tests yet to require structured kwargs, since the underlying format lacks a mapping value; kwargs remain backend-dependent. review(codex): Inline review for ISSUE-002 varargs encoding; update issues.md

- Tighten `test_all_argument_kinds_recorded_on_py_start` to require kwargs to be encoded structurally as a Sequence of (key, value) Tuples. - This codifies the intended shape for `**kwargs` discussed in ISSUE-008 and currently fails because dicts are encoded as Raw.

- Implement dict encoding in `RuntimeTracer::encode_value` by representing Python `dict` as a `Sequence` whose elements are 2-element `Tuple`s: `(String(key), encode_value(value))`. - This preserves kwargs structure losslessly without changing the `runtime_tracing` value kinds, per the proposed approach in ISSUE-008. - Keys are encoded canonically as `String` when possible; otherwise they fall back to regular value encoding. Kwarg keys are always strings. - Update the Python test to require structured kwargs encoding in `test_all_argument_kinds_recorded_on_py_start`. Rationale: Align kwargs recording with positional/varargs structure, enabling exact downstream analysis of kwargs while maintaining compatibility with the existing trace format.

- Treat `co_argcount` as the total number of positional parameters (including positional-only), per CPython 3.8+ semantics. - Stop double-counting positional-only by removing the addition of `co_posonlyargcount`. - Keeps varargs/kw-only/kwargs indexing correct and stable. Rationale: Previously we added `co_posonlyargcount` to `co_argcount`, which could shift indexes and misclassify `*args`/kw-only in edge cases. Existing tests cover presence and encoding of all argument kinds; this change aligns the implementation with the spec while preserving test expectations. Validation: - `cargo build` succeeds locally. Python test execution is not performed due to sandboxed environment, but behavior is a strict refinement consistent with CPython semantics and current tests.

tzanko-matev force-pushed the function-arguments branch from ee0b5a8 to e47d915 Compare September 15, 2025 14:27

alehander92 approved these changes Sep 16, 2025

View reviewed changes

tzanko-matev force-pushed the agent-workflow branch from d885467 to 4fcbeac Compare September 16, 2025 12:15

tzanko-matev force-pushed the function-arguments branch 2 times, most recently from 5bdbea9 to 46dc48e Compare September 16, 2025 12:21

tzanko-matev force-pushed the agent-workflow branch from 4fcbeac to 7838235 Compare September 16, 2025 12:21

Base automatically changed from agent-workflow to main September 16, 2025 15:56

tzanko-matev added 17 commits September 16, 2025 18:59

review(codex): ISSUE-001 - argument capture on function entry

2af0acc

issues(codex): Add missing Definition of Done sections to comply with…

87c92b2

… rules

issues.md: Clarify ISSUE-003

2237442

Signed-off-by: Tzanko Matev <[email protected]>

issues.md: issue-007

4fab453

Signed-off-by: Tzanko Matev <[email protected]>

Add new issues

a2ea328

tzanko-matev force-pushed the function-arguments branch from 46dc48e to 1dfea0f Compare September 16, 2025 16:00

tzanko-matev merged commit 3e5824d into main Sep 16, 2025
2 checks passed

tzanko-matev deleted the function-arguments branch September 16, 2025 16:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Record function arguments in the trace #36

Record function arguments in the trace #36

Uh oh!

tzanko-matev commented Sep 15, 2025 •

edited

Loading

Uh oh!

alehander92 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Record function arguments in the trace #36

Record function arguments in the trace #36

Uh oh!

Conversation

tzanko-matev commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tracing Function Arguments on Entry and Structured Value Encoding

Executive Summary

Goals and Non‑Goals

Background: CPython Frames and Code Objects (Quick Primer)

High‑Level Design

Parameter Ordering and Name Discovery

Value Encoding Rules (encode_value)

Attaching Arguments to the Call Event

Error Handling and Fail‑Fast Behavior

Test Specifications

Implementation Details (Where and How)

Edge Cases and Defensive Choices

Acceptance Criteria

Future Work

Uh oh!

alehander92 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tzanko-matev commented Sep 15, 2025 •

edited

Loading

Value Encoding Rules (`encode_value`)